Chad Bailey from the Pipecat team walks through what's possible with the new Gemini 3 multimodal real-time model: flight search, lodging lookup, Google Search grounding, trip report generation, and a language tutor agent, all in a single voice conversation.
Note: The public string for this model is gemini-3.1-flash-live. The string used in the video is for the Early Access Partner program and is now turned down.
What's covered: Scaffolding a bot with the Pipecat CLI, configuring Gemini 3 with minimal thinking for lower latency, writing system prompts that hold up across long conversations, defining and registering tool calls, enabling Google Search grounding, saving trip reports to disk, and running multiple agents in a single bot file with Pipecat Agents.
What are you building with Gemini Live API? Drop it in the comments.
Resources:
Gemini Live API overview →
Get started at pipecat.ai →
Pipecat examples →
Subscribe to Google for Developers →
Speaker: Chad Bailey from the Pipecat
Products Mentioned: Google AI, Gemini
|
Chad Bailey from the Pipecat team walks ...
Build full-stack applications in minutes...
Download your free Python Cheat Sheet he...
Can you spot the bug in this JavaScript ...
Download your free Python Cheat Sheet he...
本日はCoworkでAI秘書爆誕についてお話させて頂きました! ぜひご視聴くださ...
Download your free Python Cheat Sheet he...
🔥AI-Powered Digital Marketing Certificat...
AWS Security Hub Extended: Full-Stack En...
Gemini 3.1 Flash Live lets you build age...